Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks

نویسندگان

  • Feng Huang
  • Tan Lee
چکیده

In this paper, we present a study on robust pitch estimation by integrating spectral and temporal information in speech. Spectrum harmonics are important representations of the speech fundamental frequency. Harmonic-related spectral peaks of speech evolve much more slowly than the spectral peaks of noise. This motivates the proposition of temporally accumulated peak spectrum (TAPS), which is computed by cumulating spectrum peaks over consecutive analysis frames. In the TAPS, harmonic-related peaks are concentrated around the fundamental frequency and its multiples, while the peaks caused by noise are irregularly distributed with relatively small amplitude. A pitch estimation method is derived based on TAPS. The peak locations on the autocorrelation of TAPS indicate the frequency separations between the harmonic peaks, which are used to estimate the fundamental frequency. The proposed method is evaluated on speech signals corrupted by white noise, speech noise and babble noise. The results of pitch estimation show that our method performs more robustly and reliably than conventional time-domain and cepstrum-domain methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-band summary correlogram-based pitch detection for noisy speech

A multi-band summary correlogram (MBSC)-based pitch detection algorithm (PDA) is proposed. The PDA performs pitch estimation and voiced/unvoiced (V/UV) detection via novel signal processing schemes that are designed to enhance the MBSC’s peaks at the most likely pitch period. These peak-enhancement schemes include comb-filter channel-weighting to yield each individual subband’s summary correlog...

متن کامل

Noise Suppressor using Zero Phase Signal and Accumulated Spectrum Technique

This paper proposes a wide-band noise reduction method using temporal accumulated spectrum and zero phase (ZP) signal. In the previous study, we replace the ZP signal around the origin with the ZP signal in the second or latter period to get an estimated speech ZP signal. For very low SNR environment, reliable period estimation is difficult. This paper presents a study of period estimation in n...

متن کامل

Noisy speech enhancement based on long term harmonic model to improve speech intelligibility for hearing impaired listeners

This study proposes a speech enhancement algorithm to improve speech intelligibility for hearing impaired listeners in adverse conditions. The proposed algorithm is based on a long term harmonic model, where the harmonics of target speech are more distinguished from noise spectrum interference. Our method consists of two stages: i) Prominent pitch estimation based on long term harmonic feature ...

متن کامل

DOA Estimation with Local-Peak-Weighted CSP

This paper proposes a novel weighting algorithm for Cross-power Spectrum Phase (CSP) analysis to improve the accuracy of direction of arrival (DOA) estimation for beamforming in a noisy environment. Our sound source is a human speaker and the noise is broadband noise in an automobile. The harmonic structures in the human speech spectrum can be used for weighting the CSP analysis, because harmon...

متن کامل

Pitch estimation of noisy speech signals using empirical mode decomposition

This paper presents a pitch estimation method of noisy speech signal using empirical mode decomposition (EMD). The normalized autocorrelation function (NACF) of the noisy speech signal is decomposed into a finite set of band-limited signals termed as intrinsic mode functions (IMFs) using EMD. The periodicity of one IMF is supposed to be equal to the accurate pitch period. A conventional autocor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010